Transition Initiation Sites (TIS) Recognition in DNA Sequence using Machine Learning
نویسنده
چکیده
Transition Initiation Sites (TIS) prediction is a challenging problem in computational biology. In the literature TIS is predicted using various machine learning techniques such as Neural Network (NN), Support Vector Machine, etc. We have applied Principal Component Analysis (PCA) to remove highly correlated features which improves the performance in terms of time and accuracy. In this paper we have used Group Model of Data Handling (GMDH) based algorithm Abductive Network (AN) to predict TIS and got accuracy of 93%. KeywordsBioinformatics, Transition Initiation Sites (TIS), mRNA sequence, Machine Learning, Neural Network, Abductive Network, GMDH.
منابع مشابه
Prediction of Eukaryotic Translation Initiation Sites Using Machine Learning
The computational identification of translation initiation sites (TIS) is a major component of every gene prediction system, and is thus of major importance in genome annotation projects. A large number of machine learning methods have been described to identify TIS in transcripts such as mRNA, EST and cDNA sequences. In this regard, most of the prediction methods have focused on recognizing TI...
متن کاملEngineering support vector machine kernels that recognize translation initiation sites
Motivation: In order to extract protein sequences from nucleotide sequences, it is an important step to recognize points at which regions start that code for proteins. These points are called translation initiation sites (TIS). Results: The task of finding TIS can be modeled as a classification problem. We demonstrate the applicability of support vector machines for this task, and show how to i...
متن کاملEngineering Support Vector Machine Kerneis That Recognize Translation Initialion Sites
MOTIVATION In order to extract protein sequences from nucleotide sequences, it is an important step to recognize points at which regions start that code for proteins. These points are called translation initiation sites (TIS). RESULTS The task of finding TIS can be modeled as a classification problem. We demonstrate the applicability of support vector machines for this task, and show how to i...
متن کاملQuantitative analysis of mammalian translation initiation sites by FACS-seq
An approach combining fluorescence-activated cell sorting and high-throughput DNA sequencing (FACS-seq) was employed to determine the efficiency of start codon recognition for all possible translation initiation sites (TIS) utilizing AUG start codons. Using FACS-seq, we measured translation from a genetic reporter library representing all 65,536 possible TIS sequences spanning the -6 to +5 posi...
متن کاملInitiation of mtDNA transcription is followed by pausing, and diverges across human cell types and during evolution.
Mitochondrial DNA (mtDNA) genes are long known to be cotranscribed in polycistrones, yet it remains impossible to study nascent mtDNA transcripts quantitatively in vivo using existing tools. To this end, we used deep sequencing (GRO-seq and PRO-seq) and analyzed nascent mtDNA-encoded RNA transcripts in diverse human cell lines and metazoan organisms. Surprisingly, accurate detection of human mt...
متن کامل